Dataset statistics
| Number of variables | 14 |
|---|---|
| Number of observations | 2356284 |
| Missing cells | 11809696 |
| Missing cells (%) | 35.8% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 251.7 MiB |
| Average record size in memory | 112.0 B |
Variable types
| CAT | 7 |
|---|---|
| NUM | 7 |
country_region_code has a high cardinality: 134 distinct values | High cardinality |
country_region has a high cardinality: 135 distinct values | High cardinality |
sub_region_1 has a high cardinality: 1855 distinct values | High cardinality |
sub_region_2 has a high cardinality: 9682 distinct values | High cardinality |
metro_area has a high cardinality: 65 distinct values | High cardinality |
iso_3166_2_code has a high cardinality: 2143 distinct values | High cardinality |
date has a high cardinality: 203 distinct values | High cardinality |
sub_region_1 has 40330 (1.7%) missing values | Missing |
sub_region_2 has 401840 (17.1%) missing values | Missing |
metro_area has 2343224 (99.4%) missing values | Missing |
iso_3166_2_code has 1942157 (82.4%) missing values | Missing |
census_fips_code has 1828595 (77.6%) missing values | Missing |
retail_and_recreation_percent_change_from_baseline has 825720 (35.0%) missing values | Missing |
grocery_and_pharmacy_percent_change_from_baseline has 854377 (36.3%) missing values | Missing |
parks_percent_change_from_baseline has 1210576 (51.4%) missing values | Missing |
transit_stations_percent_change_from_baseline has 1130892 (48.0%) missing values | Missing |
workplaces_percent_change_from_baseline has 109641 (4.7%) missing values | Missing |
residential_percent_change_from_baseline has 1120763 (47.6%) missing values | Missing |
metro_area is uniformly distributed | Uniform |
retail_and_recreation_percent_change_from_baseline has 25590 (1.1%) zeros | Zeros |
grocery_and_pharmacy_percent_change_from_baseline has 40653 (1.7%) zeros | Zeros |
workplaces_percent_change_from_baseline has 40157 (1.7%) zeros | Zeros |
residential_percent_change_from_baseline has 65648 (2.8%) zeros | Zeros |
Reproduction
| Analysis started | 2020-10-03 05:35:35.965433 |
|---|---|
| Analysis finished | 2020-10-03 05:37:07.701105 |
| Duration | 1 minute and 31.74 seconds |
| Software version | pandas-profiling v2.9.0 |
| Download configuration | config.yaml |
| Distinct | 134 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1581 |
| Missing (%) | 0.1% |
| Memory size | 18.0 MiB |
| US | |
|---|---|
| BR | |
| IN | |
| TR | 106829 |
| GB | 84876 |
| Other values (129) |
| Value | Count | Frequency (%) | |
| US | 538042 | 22.8% | |
| BR | 372159 | 15.8% | |
| IN | 134996 | 5.7% | |
| TR | 106829 | 4.5% | |
| GB | 84876 | 3.6% | |
| AR | 83788 | 3.6% | |
| PL | 77298 | 3.3% | |
| NL | 71721 | 3.0% | |
| CO | 66368 | 2.8% | |
| AU | 54819 | 2.3% | |
| Other values (124) | 763807 | 32.4% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 3 |
|---|---|
| Median length | 2 |
| Mean length | 2.000670972 |
| Min length | 2 |
| Distinct | 135 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 18.0 MiB |
| United States | |
|---|---|
| Brazil | |
| India | |
| Turkey | 106829 |
| United Kingdom | 84876 |
| Other values (130) |
| Value | Count | Frequency (%) | |
| United States | 538042 | 22.8% | |
| Brazil | 372159 | 15.8% | |
| India | 134996 | 5.7% | |
| Turkey | 106829 | 4.5% | |
| United Kingdom | 84876 | 3.6% | |
| Argentina | 83788 | 3.6% | |
| Poland | 77298 | 3.3% | |
| Netherlands | 71721 | 3.0% | |
| Colombia | 66368 | 2.8% | |
| Australia | 54819 | 2.3% | |
| Other values (125) | 765388 | 32.5% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 22 |
|---|---|
| Median length | 7 |
| Mean length | 8.561637731 |
| Min length | 4 |
| Distinct | 1855 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 40330 |
| Missing (%) | 1.7% |
| Memory size | 18.0 MiB |
| State of São Paulo | 70288 |
|---|---|
| State of Minas Gerais | 47607 |
| Texas | 40059 |
| State of Rio Grande do Sul | 29212 |
| Georgia | 28321 |
| Other values (1850) |
| Value | Count | Frequency (%) | |
| State of São Paulo | 70288 | 3.0% | |
| State of Minas Gerais | 47607 | 2.0% | |
| Texas | 40059 | 1.7% | |
| State of Rio Grande do Sul | 29212 | 1.2% | |
| Georgia | 28321 | 1.2% | |
| State of Paraná | 27964 | 1.2% | |
| State of Bahia | 25873 | 1.1% | |
| Virginia | 25156 | 1.1% | |
| Buenos Aires Province | 25114 | 1.1% | |
| State of Santa Catarina | 23461 | 1.0% | |
| Other values (1845) | 1972899 | 83.7% | |
| (Missing) | 40330 | 1.7% |
Frequencies of value counts
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | < 0.1% |
Histogram of lengths of the category
Length
| Max length | 74 |
|---|---|
| Median length | 11 |
| Mean length | 12.22135999 |
| Min length | 3 |
| Distinct | 9682 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 401840 |
| Missing (%) | 17.1% |
| Memory size | 18.0 MiB |
| Washington County | 5623 |
|---|---|
| Jefferson County | 4664 |
| Franklin County | 4382 |
| Jackson County | 4019 |
| Lincoln County | 3848 |
| Other values (9677) |
| Value | Count | Frequency (%) | |
| Washington County | 5623 | 0.2% | |
| Jefferson County | 4664 | 0.2% | |
| Franklin County | 4382 | 0.2% | |
| Jackson County | 4019 | 0.2% | |
| Lincoln County | 3848 | 0.2% | |
| Madison County | 3688 | 0.2% | |
| Montgomery County | 3419 | 0.1% | |
| Marion County | 3245 | 0.1% | |
| Union County | 3128 | 0.1% | |
| Monroe County | 3121 | 0.1% | |
| Other values (9672) | 1915307 | 81.3% | |
| (Missing) | 401840 | 17.1% |
Frequencies of value counts
Unique
| Unique | 7 ? |
|---|---|
| Unique (%) | < 0.1% |
Histogram of lengths of the category
Length
| Max length | 56 |
|---|---|
| Median length | 11 |
| Mean length | 11.40147453 |
| Min length | 2 |
| Distinct | 65 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 2343224 |
| Missing (%) | 99.4% |
| Memory size | 18.0 MiB |
| Lahore Metropolitan Area | 203 |
|---|---|
| Montevideo Metropolitan Area | 203 |
| Kuwait City Metropolitan Area | 203 |
| Doha Metropolitan Area | 203 |
| Bamako Metropolitan Area | 203 |
| Other values (60) |
| Value | Count | Frequency (%) | |
| Lahore Metropolitan Area | 203 | < 0.1% | |
| Montevideo Metropolitan Area | 203 | < 0.1% | |
| Kuwait City Metropolitan Area | 203 | < 0.1% | |
| Doha Metropolitan Area | 203 | < 0.1% | |
| Bamako Metropolitan Area | 203 | < 0.1% | |
| Ufa Metropolitan Area | 203 | < 0.1% | |
| Marrakesh Metropolitan Area | 203 | < 0.1% | |
| Rostov-on-Don Metropolitan Area | 203 | < 0.1% | |
| Baguio Metropolitan Area | 203 | < 0.1% | |
| Quetta Metropolitan Area | 203 | < 0.1% | |
| Other values (55) | 11030 | 0.5% | |
| (Missing) | 2343224 | 99.4% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 34 |
|---|---|
| Median length | 3 |
| Mean length | 3.125661423 |
| Min length | 3 |
| Distinct | 2143 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 1942157 |
| Missing (%) | 82.4% |
| Memory size | 18.0 MiB |
| RO-BT | 203 |
|---|---|
| CI-09 | 203 |
| LT-TE | 203 |
| GB-BGE | 203 |
| GB-HRY | 203 |
| Other values (2138) |
| Value | Count | Frequency (%) | |
| RO-BT | 203 | < 0.1% | |
| CI-09 | 203 | < 0.1% | |
| LT-TE | 203 | < 0.1% | |
| GB-BGE | 203 | < 0.1% | |
| GB-HRY | 203 | < 0.1% | |
| GT-GU | 203 | < 0.1% | |
| IT-78 | 203 | < 0.1% | |
| NI-CA | 203 | < 0.1% | |
| GB-STT | 203 | < 0.1% | |
| PL-WN | 203 | < 0.1% | |
| Other values (2133) | 412097 | 17.5% | |
| (Missing) | 1942157 | 82.4% |
Frequencies of value counts
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | < 0.1% |
Histogram of lengths of the category
Length
| Max length | 6 |
|---|---|
| Median length | 3 |
| Mean length | 3.379635901 |
| Min length | 3 |
| Distinct | 2833 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 1828595 |
| Missing (%) | 77.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 30338.94115 |
|---|---|
| Minimum | 1001 |
| Maximum | 56045 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 18.0 MiB |
Quantile statistics
| Minimum | 1001 |
|---|---|
| 5-th percentile | 5099 |
| Q1 | 18109 |
| median | 29105 |
| Q3 | 45057 |
| 95-th percentile | 53075 |
| Maximum | 56045 |
| Range | 55044 |
| Interquartile range (IQR) | 26948 |
Descriptive statistics
| Standard deviation | 15296.7533 |
|---|---|
| Coefficient of variation (CV) | 0.5041953582 |
| Kurtosis | -1.128529214 |
| Mean | 30338.94115 |
| Median Absolute Deviation (MAD) | 12000 |
| Skewness | -0.06710196046 |
| Sum | 1.600952552e+10 |
| Variance | 233990661.6 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 25011 | 203 | < 0.1% | |
| 25025 | 203 | < 0.1% | |
| 25023 | 203 | < 0.1% | |
| 25021 | 203 | < 0.1% | |
| 25017 | 203 | < 0.1% | |
| 25015 | 203 | < 0.1% | |
| 25013 | 203 | < 0.1% | |
| 25009 | 203 | < 0.1% | |
| 26005 | 203 | < 0.1% | |
| 25005 | 203 | < 0.1% | |
| Other values (2823) | 525659 | 22.3% | |
| (Missing) | 1828595 | 77.6% |
| Value | Count | Frequency (%) | |
| 1001 | 203 | < 0.1% | |
| 1003 | 203 | < 0.1% | |
| 1005 | 203 | < 0.1% | |
| 1007 | 203 | < 0.1% | |
| 1009 | 203 | < 0.1% |
| Value | Count | Frequency (%) | |
| 56045 | 147 | < 0.1% | |
| 56043 | 153 | < 0.1% | |
| 56041 | 203 | < 0.1% | |
| 56039 | 203 | < 0.1% | |
| 56037 | 203 | < 0.1% |
| Distinct | 203 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 18.0 MiB |
| 2020-07-09 | 12896 |
|---|---|
| 2020-06-23 | 12893 |
| 2020-07-16 | 12891 |
| 2020-06-18 | 12891 |
| 2020-07-30 | 12891 |
| Other values (198) |
| Value | Count | Frequency (%) | |
| 2020-07-09 | 12896 | 0.5% | |
| 2020-06-23 | 12893 | 0.5% | |
| 2020-07-16 | 12891 | 0.5% | |
| 2020-06-18 | 12891 | 0.5% | |
| 2020-07-30 | 12891 | 0.5% | |
| 2020-07-02 | 12891 | 0.5% | |
| 2020-06-24 | 12891 | 0.5% | |
| 2020-06-17 | 12890 | 0.5% | |
| 2020-07-03 | 12890 | 0.5% | |
| 2020-07-07 | 12888 | 0.5% | |
| Other values (193) | 2227372 | 94.5% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
| Distinct | 422 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 825720 |
| Missing (%) | 35.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -24.40305012 |
|---|---|
| Minimum | -100 |
| Maximum | 545 |
| Zeros | 25590 |
| Zeros (%) | 1.1% |
| Memory size | 18.0 MiB |
Quantile statistics
| Minimum | -100 |
|---|---|
| 5-th percentile | -79 |
| Q1 | -48 |
| median | -20 |
| Q3 | -1 |
| 95-th percentile | 18 |
| Maximum | 545 |
| Range | 645 |
| Interquartile range (IQR) | 47 |
Descriptive statistics
| Standard deviation | 31.56306826 |
|---|---|
| Coefficient of variation (CV) | -1.293406689 |
| Kurtosis | 2.430111479 |
| Mean | -24.40305012 |
| Median Absolute Deviation (MAD) | 22 |
| Skewness | 0.1635129616 |
| Sum | -37350430 |
| Variance | 996.2272777 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 25590 | 1.1% | |
| -2 | 25170 | 1.1% | |
| -1 | 25140 | 1.1% | |
| 1 | 24823 | 1.1% | |
| 2 | 24621 | 1.0% | |
| -3 | 24544 | 1.0% | |
| -4 | 23956 | 1.0% | |
| -5 | 23327 | 1.0% | |
| 3 | 23223 | 1.0% | |
| -6 | 22659 | 1.0% | |
| Other values (412) | 1287511 | 54.6% | |
| (Missing) | 825720 | 35.0% |
| Value | Count | Frequency (%) | |
| -100 | 171 | < 0.1% | |
| -99 | 80 | < 0.1% | |
| -98 | 428 | < 0.1% | |
| -97 | 873 | < 0.1% | |
| -96 | 1392 | 0.1% |
| Value | Count | Frequency (%) | |
| 545 | 1 | < 0.1% | |
| 533 | 1 | < 0.1% | |
| 527 | 2 | < 0.1% | |
| 524 | 1 | < 0.1% | |
| 513 | 1 | < 0.1% |
| Distinct | 443 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 854377 |
| Missing (%) | 36.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -5.039780093 |
|---|---|
| Minimum | -100 |
| Maximum | 454 |
| Zeros | 40653 |
| Zeros (%) | 1.7% |
| Memory size | 18.0 MiB |
Quantile statistics
| Minimum | -100 |
|---|---|
| 5-th percentile | -50 |
| Q1 | -16 |
| median | -2 |
| Q3 | 8 |
| 95-th percentile | 29 |
| Maximum | 454 |
| Range | 554 |
| Interquartile range (IQR) | 24 |
Descriptive statistics
| Standard deviation | 25.18208074 |
|---|---|
| Coefficient of variation (CV) | -4.996662608 |
| Kurtosis | 6.786753458 |
| Mean | -5.039780093 |
| Median Absolute Deviation (MAD) | 12 |
| Skewness | 0.1762826237 |
| Sum | -7569281 |
| Variance | 634.1371905 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 2 | 40943 | 1.7% | |
| 0 | 40653 | 1.7% | |
| 1 | 40313 | 1.7% | |
| 3 | 39959 | 1.7% | |
| 4 | 38839 | 1.6% | |
| -1 | 38644 | 1.6% | |
| -2 | 37516 | 1.6% | |
| 5 | 36165 | 1.5% | |
| -3 | 35165 | 1.5% | |
| 6 | 34710 | 1.5% | |
| Other values (433) | 1119000 | 47.5% | |
| (Missing) | 854377 | 36.3% |
| Value | Count | Frequency (%) | |
| -100 | 76 | < 0.1% | |
| -99 | 1 | < 0.1% | |
| -98 | 43 | < 0.1% | |
| -97 | 230 | < 0.1% | |
| -96 | 547 | < 0.1% |
| Value | Count | Frequency (%) | |
| 454 | 1 | < 0.1% | |
| 400 | 1 | < 0.1% | |
| 387 | 1 | < 0.1% | |
| 385 | 1 | < 0.1% | |
| 382 | 1 | < 0.1% |
| Distinct | 904 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 1210576 |
| Missing (%) | 51.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -6.345571472 |
|---|---|
| Minimum | -100 |
| Maximum | 1206 |
| Zeros | 10462 |
| Zeros (%) | 0.4% |
| Memory size | 18.0 MiB |
Quantile statistics
| Minimum | -100 |
|---|---|
| 5-th percentile | -82 |
| Q1 | -49 |
| median | -16 |
| Q3 | 16 |
| 95-th percentile | 113 |
| Maximum | 1206 |
| Range | 1306 |
| Interquartile range (IQR) | 65 |
Descriptive statistics
| Standard deviation | 65.94798252 |
|---|---|
| Coefficient of variation (CV) | -10.39275703 |
| Kurtosis | 13.63809997 |
| Mean | -6.345571472 |
| Median Absolute Deviation (MAD) | 32 |
| Skewness | 2.484969709 |
| Sum | -7270172 |
| Variance | 4349.136399 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| -2 | 10515 | 0.4% | |
| 0 | 10462 | 0.4% | |
| -1 | 10301 | 0.4% | |
| -4 | 10243 | 0.4% | |
| -3 | 10205 | 0.4% | |
| -7 | 10096 | 0.4% | |
| 2 | 10061 | 0.4% | |
| -6 | 10053 | 0.4% | |
| 1 | 10023 | 0.4% | |
| -5 | 9937 | 0.4% | |
| Other values (894) | 1043812 | 44.3% | |
| (Missing) | 1210576 | 51.4% |
| Value | Count | Frequency (%) | |
| -100 | 3440 | 0.1% | |
| -99 | 268 | < 0.1% | |
| -98 | 748 | < 0.1% | |
| -97 | 998 | < 0.1% | |
| -96 | 1402 | 0.1% |
| Value | Count | Frequency (%) | |
| 1206 | 1 | < 0.1% | |
| 1187 | 1 | < 0.1% | |
| 1150 | 1 | < 0.1% | |
| 1149 | 2 | < 0.1% | |
| 1146 | 1 | < 0.1% |
| Distinct | 481 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1130892 |
| Missing (%) | 48.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -28.74046101 |
|---|---|
| Minimum | -100 |
| Maximum | 497 |
| Zeros | 14325 |
| Zeros (%) | 0.6% |
| Memory size | 18.0 MiB |
Quantile statistics
| Minimum | -100 |
|---|---|
| 5-th percentile | -77 |
| Q1 | -52 |
| median | -29 |
| Q3 | -6 |
| 95-th percentile | 18 |
| Maximum | 497 |
| Range | 597 |
| Interquartile range (IQR) | 46 |
Descriptive statistics
| Standard deviation | 31.57209016 |
|---|---|
| Coefficient of variation (CV) | -1.098524138 |
| Kurtosis | 4.832432845 |
| Mean | -28.74046101 |
| Median Absolute Deviation (MAD) | 23 |
| Skewness | 0.7771535017 |
| Sum | -35218331 |
| Variance | 996.7968771 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 14325 | 0.6% | |
| -35 | 14070 | 0.6% | |
| -2 | 14049 | 0.6% | |
| -40 | 14030 | 0.6% | |
| -33 | 13967 | 0.6% | |
| -32 | 13866 | 0.6% | |
| -29 | 13859 | 0.6% | |
| -36 | 13836 | 0.6% | |
| -31 | 13818 | 0.6% | |
| -30 | 13807 | 0.6% | |
| Other values (471) | 1085765 | 46.1% | |
| (Missing) | 1130892 | 48.0% |
| Value | Count | Frequency (%) | |
| -100 | 1386 | 0.1% | |
| -99 | 1 | < 0.1% | |
| -98 | 66 | < 0.1% | |
| -97 | 228 | < 0.1% | |
| -96 | 457 | < 0.1% |
| Value | Count | Frequency (%) | |
| 497 | 1 | < 0.1% | |
| 485 | 1 | < 0.1% | |
| 462 | 1 | < 0.1% | |
| 461 | 1 | < 0.1% | |
| 459 | 1 | < 0.1% |
| Distinct | 282 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 109641 |
| Missing (%) | 4.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -21.9506526 |
|---|---|
| Minimum | -100 |
| Maximum | 258 |
| Zeros | 40157 |
| Zeros (%) | 1.7% |
| Memory size | 18.0 MiB |
Quantile statistics
| Minimum | -100 |
|---|---|
| 5-th percentile | -59 |
| Q1 | -35 |
| median | -22 |
| Q3 | -5 |
| 95-th percentile | 9 |
| Maximum | 258 |
| Range | 358 |
| Interquartile range (IQR) | 30 |
Descriptive statistics
| Standard deviation | 21.26001117 |
|---|---|
| Coefficient of variation (CV) | -0.9685366338 |
| Kurtosis | 0.2736910935 |
| Mean | -21.9506526 |
| Median Absolute Deviation (MAD) | 15 |
| Skewness | -0.2249839977 |
| Sum | -49315280 |
| Variance | 451.9880751 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| -24 | 45900 | 1.9% | |
| -25 | 45524 | 1.9% | |
| -26 | 45368 | 1.9% | |
| -27 | 45179 | 1.9% | |
| -28 | 44314 | 1.9% | |
| -23 | 44272 | 1.9% | |
| -22 | 43306 | 1.8% | |
| -29 | 42989 | 1.8% | |
| 1 | 42650 | 1.8% | |
| 2 | 42508 | 1.8% | |
| Other values (272) | 1804633 | 76.6% | |
| (Missing) | 109641 | 4.7% |
| Value | Count | Frequency (%) | |
| -100 | 2 | < 0.1% | |
| -95 | 2 | < 0.1% | |
| -94 | 11 | < 0.1% | |
| -93 | 21 | < 0.1% | |
| -92 | 69 | < 0.1% |
| Value | Count | Frequency (%) | |
| 258 | 1 | < 0.1% | |
| 248 | 1 | < 0.1% | |
| 246 | 1 | < 0.1% | |
| 241 | 3 | < 0.1% | |
| 239 | 1 | < 0.1% |
| Distinct | 97 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1120763 |
| Missing (%) | 47.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10.27266392 |
|---|---|
| Minimum | -46 |
| Maximum | 57 |
| Zeros | 65648 |
| Zeros (%) | 2.8% |
| Memory size | 18.0 MiB |
Quantile statistics
| Minimum | -46 |
|---|---|
| 5-th percentile | -2 |
| Q1 | 3 |
| median | 9 |
| Q3 | 16 |
| 95-th percentile | 27 |
| Maximum | 57 |
| Range | 103 |
| Interquartile range (IQR) | 13 |
Descriptive statistics
| Standard deviation | 9.007779345 |
|---|---|
| Coefficient of variation (CV) | 0.8768688838 |
| Kurtosis | 0.1873143663 |
| Mean | 10.27266392 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | 0.6592314183 |
| Sum | 12692092 |
| Variance | 81.14008873 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 65648 | 2.8% | |
| 1 | 58716 | 2.5% | |
| 10 | 52080 | 2.2% | |
| 9 | 51312 | 2.2% | |
| 11 | 51164 | 2.2% | |
| -1 | 50437 | 2.1% | |
| 8 | 49668 | 2.1% | |
| 7 | 49254 | 2.1% | |
| 12 | 49230 | 2.1% | |
| 6 | 49230 | 2.1% | |
| Other values (87) | 708782 | 30.1% | |
| (Missing) | 1120763 | 47.6% |
| Value | Count | Frequency (%) | |
| -46 | 2 | < 0.1% | |
| -45 | 1 | < 0.1% | |
| -43 | 1 | < 0.1% | |
| -40 | 1 | < 0.1% | |
| -39 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 57 | 1 | < 0.1% | |
| 56 | 2 | < 0.1% | |
| 55 | 5 | < 0.1% | |
| 54 | 12 | < 0.1% | |
| 53 | 13 | < 0.1% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| country_region_code | country_region | sub_region_1 | sub_region_2 | metro_area | iso_3166_2_code | census_fips_code | date | retail_and_recreation_percent_change_from_baseline | grocery_and_pharmacy_percent_change_from_baseline | parks_percent_change_from_baseline | transit_stations_percent_change_from_baseline | workplaces_percent_change_from_baseline | residential_percent_change_from_baseline | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | AE | United Arab Emirates | NaN | NaN | NaN | NaN | NaN | 2020-02-15 | 0.0 | 4.0 | 5.0 | 0.0 | 2.0 | 1.0 |
| 1 | AE | United Arab Emirates | NaN | NaN | NaN | NaN | NaN | 2020-02-16 | 1.0 | 4.0 | 4.0 | 1.0 | 2.0 | 1.0 |
| 2 | AE | United Arab Emirates | NaN | NaN | NaN | NaN | NaN | 2020-02-17 | -1.0 | 1.0 | 5.0 | 1.0 | 2.0 | 1.0 |
| 3 | AE | United Arab Emirates | NaN | NaN | NaN | NaN | NaN | 2020-02-18 | -2.0 | 1.0 | 5.0 | 0.0 | 2.0 | 1.0 |
| 4 | AE | United Arab Emirates | NaN | NaN | NaN | NaN | NaN | 2020-02-19 | -2.0 | 0.0 | 4.0 | -1.0 | 2.0 | 1.0 |
| 5 | AE | United Arab Emirates | NaN | NaN | NaN | NaN | NaN | 2020-02-20 | -2.0 | 1.0 | 6.0 | 1.0 | 1.0 | 1.0 |
| 6 | AE | United Arab Emirates | NaN | NaN | NaN | NaN | NaN | 2020-02-21 | -3.0 | 2.0 | 6.0 | 0.0 | -1.0 | 1.0 |
| 7 | AE | United Arab Emirates | NaN | NaN | NaN | NaN | NaN | 2020-02-22 | -2.0 | 2.0 | 4.0 | -2.0 | 3.0 | 1.0 |
| 8 | AE | United Arab Emirates | NaN | NaN | NaN | NaN | NaN | 2020-02-23 | -1.0 | 3.0 | 3.0 | -1.0 | 4.0 | 1.0 |
| 9 | AE | United Arab Emirates | NaN | NaN | NaN | NaN | NaN | 2020-02-24 | -3.0 | 0.0 | 5.0 | -1.0 | 3.0 | 1.0 |
Last rows
| country_region_code | country_region | sub_region_1 | sub_region_2 | metro_area | iso_3166_2_code | census_fips_code | date | retail_and_recreation_percent_change_from_baseline | grocery_and_pharmacy_percent_change_from_baseline | parks_percent_change_from_baseline | transit_stations_percent_change_from_baseline | workplaces_percent_change_from_baseline | residential_percent_change_from_baseline | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2356274 | ZW | Zimbabwe | Midlands Province | Kwekwe | NaN | NaN | NaN | 2020-08-24 | NaN | NaN | NaN | NaN | -4.0 | NaN |
| 2356275 | ZW | Zimbabwe | Midlands Province | Kwekwe | NaN | NaN | NaN | 2020-08-25 | NaN | NaN | NaN | NaN | 7.0 | NaN |
| 2356276 | ZW | Zimbabwe | Midlands Province | Kwekwe | NaN | NaN | NaN | 2020-08-26 | NaN | NaN | NaN | NaN | 1.0 | NaN |
| 2356277 | ZW | Zimbabwe | Midlands Province | Kwekwe | NaN | NaN | NaN | 2020-08-27 | NaN | NaN | NaN | NaN | 0.0 | NaN |
| 2356278 | ZW | Zimbabwe | Midlands Province | Kwekwe | NaN | NaN | NaN | 2020-08-28 | NaN | NaN | NaN | NaN | -3.0 | NaN |
| 2356279 | ZW | Zimbabwe | Midlands Province | Kwekwe | NaN | NaN | NaN | 2020-08-31 | NaN | NaN | NaN | NaN | -1.0 | NaN |
| 2356280 | ZW | Zimbabwe | Midlands Province | Kwekwe | NaN | NaN | NaN | 2020-09-01 | NaN | NaN | NaN | NaN | -2.0 | NaN |
| 2356281 | ZW | Zimbabwe | Midlands Province | Kwekwe | NaN | NaN | NaN | 2020-09-02 | NaN | NaN | NaN | NaN | 5.0 | NaN |
| 2356282 | ZW | Zimbabwe | Midlands Province | Kwekwe | NaN | NaN | NaN | 2020-09-03 | NaN | NaN | NaN | NaN | 6.0 | NaN |
| 2356283 | ZW | Zimbabwe | Midlands Province | Kwekwe | NaN | NaN | NaN | 2020-09-04 | NaN | NaN | NaN | NaN | 2.0 | NaN |